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Abstract. We prove logarithmic Sobolev inequality for measures 

= dist(X") = exp(-y(a;")), G M", 

under the assumptions that: 

(i) the conditional distributions 

Qi{-\xj,3 ^i) = A\s\.{Xi\Xj = Xj,j ^ i) 

satisfy a logarithmic Sobolev inequality with a common constant p, and 

(ii) they also satisfy some condition expressing that the mixed partial derivatives of 
the Hamiltonian V are not too large relative to p. 

Condition (ii) has the form that the norms of some matrices defined in terms of 
the mixed partial derivatives of V do not exceed 1/2 ■ p ■ (1 — 5). The logarithmic 
Sobolev constant of q"' can then be estimated from below by 1/2- p-S. This improves 
on earlier results by Th. Bodineau and B. Helffer, by giving an explicit bound, for 
the logarithmic Sobolev constant for q"^ . 
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LOGARITHMIC SOBOLEV INEQUALITY 



1. Introduction. 

Let us consider the absolutely continuous probability measures on the n-dimensional 
Euclidean space R". With some abuse of notation, we use the same letter to denote 
a probability measure and its density function. 

If and are density functions on the same Euclidean space M"^, then D[p^\\q'^) 
and I{p'^\\q'^) will denote, respectively, the relative entropy (called also informa- 
tional divergence) and the Fisher information of p^ with respect to q'^ : 
If g^(a;^) = ^ p"(a;'^) = then 

D{p^q-) = j^Jog^^p-{x^)dx\ 

otherwise D{p^\\q^) = oo; if log ^tt^ is smooth a.e.. then 

p^^x^ 



I{p-\\q^) 



Vlog 



p{x"')dx' 



Definition. The density function q"^ satisfies a logarithmic Sobolev inequality with 
constant p > if for any density function p"' on M."', with log(p'^/g"^) smooth, 

Dip^q'^) < ^/(p"||g"). 



A logarithmic Sobolev inequality for q'^ can be used to control the rate of con- 
vergence to equilibrium for the diffusion process with limit distribution q'^, and 
is equivalent to the hypercontractivity of the associated semigroup. The proto- 
type is Gross' logarithmic Sobolev inequality for the Gaussian measures which is 
associated with the Ornstein-Uhlenbeck process [Gr]. Another use of logarithmic 
Sobolev inequalities is to derive transportation cost inequalities, a tool to prove 
measure concentration (F. Otto, C. Villani [0-V]). We shall return to the Otto- 
Villani theorem later in the introduction. The logarithmic Sobolev inequality for 
spin systems is equivalent to the property called "exponential decay of correlation" ; 
for this concept we refer to Stroock and Zegarlinski, [SZl], [SZ2], [SZ3], Zegarlinski 
[Z], Bodineau and Helffer [B-H] and HelflFer [H]. 

Our aim is to prove a logarithmic Sobolev inequality for measures with positive 
density 

= dist(X") = exp(-y(x")), x"" e M^, 
under the assumption that the conditional distributions 

Qii-\xjJ i^i) = dist(Xi|Xj = Xj,2 7^ ^) 

satisfy a logarithmic Sobolev inequality with a constant p independent of i and x", 
and also some condition expressing that the mixed partial derivatives of V are not 
too large relative to p. We want a logarithmic Sobolev constant independent of n. 
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Much work has been done on this subject. When the range of the X^'s is finite 
then another definition is needed for Fisher information, and thus for logarithmic 
Sobolev inequahty. Sec [D-H], [SZ2], [SZ3]. The case when the range of the X^'s is 
finite or compact was studied by J-D. Deuschel and D. W. HoUey [D-H], D. Stroock 
and B. Zegarhnski [S-Zl], [S-Z2], [S-Z3], L. Lu and H. T. Yau [L-Y] and others. 
In [S-Z3] Stroock and Zegarhnski prove equivalence between logarithmic Sobolev 
inequality and Dobrushin and Shlosman's strong mixing condition in the discrete 
case. The discrete or compact cases are simpler, basically because of the following 
reason: A random variable with finite range automatically admits a logarithmic 
Sobolev inequality with a constant depending only on the size of the range. The 
same holds for a one-dimensional compact range, but not for a one-dimensional 
non-compact range. 

The non-compact case was studied by N. Yoshida [Yl], [Y2], B. Helffer [He], B. 
Helffer and Th. Bodineau [Bod- He] and M. Ledoux [L]. Their results assert the 
existence of a positive logarithmic Sobolev constant under (more or less) the above 
conditions. But these results do not say much about how small the mixed partial 
derivatives of V should be cmpared to the logarithmic Sobolev constant of the 
Qi{-\xiys, and do not provide an explicit lower bound on the logarithmic Sobolev 
constant of q'^. Our aim is to improve on earlier results in this respect. 

We shall use the following 

Notation: 

• For a;" = {xi, X2, ■ ■ ■ x^) G and 1 < i < n, 

Xi = {xj : j ^ i), x' = {xj : j < i), x^ = {xj : i < j < n); 

• p'^ : density of an absolutely continuous probability measure on R"'; 

• = y"'(l) : random sequence in W^, dist(y^'^) =p"; 

• Pii'lVi) = dist(Yi|yj = Vi) <i < n) : conditional density functions 
consistent with p"; 

• pi:=dist{Yi), p' = dist{Y'), pf{-\y') = dist{Y;'\Y' = y'). 

• : density of a fixed absolutely continuous probability measure on W^; 

• X" : random sequence in M", dist(X") — q^; 

• Q = {Qi{xi\xi)^ <i e ]R e IR""^) : system of conditional 

density functions; usually Qi{-\xi) — dist(Xj|Xi = Xi) (1 < i < n). 

The following sufficient condition for logarithmic Sobolev inequalities is well 
known; it follows from the Bakry and Emery's celebrated criterion [Ba-E], supple- 
mented by a perturbation result by HoUey and Stroock [Ho-S]: 
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Theorem 0. Let q{x^) = exp(— y(a;"^)) be a density function on M'^, and let V be 
strictly convex at oo, i.e., V{x^) = U{x^) + K{x*'), where 

Hess(U){x'') = {dijU{x'')) >c-In 

for some c > (where In is the identity matrix), and K{x'^) is bounded. Then 
q satisfies a logarithmic Sobolev inequality with constant p, depending on c and 

Halloo.- 

p > c • exp(-4||K||oo)- 
In particular, if V is uniformly strictly convex, i.e., Hess{V) > c- In, then p > c. 



Note that S. Bobkov and F. Gotze [B-G] established a characterization of density 
functions on IR, satisfying a logarithmic Sobolev inequality. 

We mention also the very important fact that a product distribution admits a 
logarithmic Sobolev inequality with constant p, provided the factors have logarith- 
mic Sobolev constants > p. 

In particular, a product distribution where all factors are uniformly bounded 
perturbations of uniformly log-concave distributions, admits a logarithmic Sobolev 
inequality with a controllable constant. The simplest case beyond this is when 
all the Qj(-|Si)'s are uniformly bounded perturbations of uniformly log-concave 
distributions, but there is a weak dependence between the coordinates. This was 
the case investigated by N. Yoshida [Yl], [Y2], B. Helffer [He], B. Helffer and Th. 
Bodineau [Bod-He] and M. Ledoux [L], and this is the theme of the present paper. 

We proceed to the notations and definitions needed to formulate the main result. 

Let us be given a system of conditional density functions 

Q= {Qi{xi\xi), l<i<n, XiER XieW-^). (1.1) 

We consider Qi{xi\xi) as a function of n variables xi, X2, . . . , Xn, and di will denote 
partial derivative with respect to the z'th variable. Define the triangular function 
matrices Bi and B2 as follows: For y'^,z'^,ri'^ e put 

{y\ ry", z^) = d,,k (- log Q^{v^ \y'-\ ^D) , 

and define 

B^iy^,rj^,z^) = B^iy^ , rj^ , z^) = r7^ ^")),^,. 

If Q consists of the conditional density functions of g"(x") = exp{—V{x'^)) then 

since — logQi{xi\xi) and V{x^) differ in an additive term not depending on Xi. 
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Definition. Let us assume that the conditional density functions (1.1) satisfy the 
logarithmic Sobolev inequality with constant p independent of i and Xi. We say 
that the system Q in (1.1) satisfies the 1/2(1 — 5)-contracivity condition for partial 
derivatives if 



sup 



s,(2/^^^^") 



<^-(i-^), 



(C) 



We shall use the following short-hand 



Notation. 

A system Q of conditional density functions satisfies condition LSIk,C{p, 1/2(1 — 5)) 
if the Qi{-\xiys satisfy a logarithmic Sobolev inequality with constant p, and also 
the 1/2(1 — 5)-contracivity condition for partial derivatives. 

The aim of this paper is to prove the following 

Theorem. 

If the conditional density functions Qi{-\xi) satisfy condition 

LS'/&C(p, 1/2(1 — 5)) then there exists a unique density function q'^{x^) on M."^ 
such that its conditional density functions are the functions (1-1), and satisfies 
the logarithmic Sobolev inequality 

i^(p"||g") < ^/(p"||g"). (1.2) 

pd 

I.e., the logarithmic Sobolev constant of q^ is at least p5/2. 



This is clearly a perturbation result. 
Inequalty (1.2) can also be written in the form 

Dip'-m < 7, II MR |n ^(P"llg")- (1-3) 

p — 2 • supmax{||Si||, ||52||} 

Note that the L5'/&C(p, 1/2(1 — 6)) condition depends on the system of co- 
ordinates. This is a serious drawback, although in the case of spin systems this 
is natural. Moreover, the LS I SzC{p, 1/2(1 — 5)) condition also depends on the 
ordering of coordinates (see later). 

Because of the dependence on the system of coordinates, there are important 
families of distributions q'^{x'^) — exp(— ^(a:")) (with n growing) that admit a 
logarithmic Sobolev inequality with a constant independent of n, without satisfying 
the conditions of our Theorem. In fact, this is the case with many convex quadratic 
functions V{x"'), e.g., 

n n 

Vb(a;^) = 1/2-^ + 1/2- (^Xi-M) V const., MeM fixed. 
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In a paper on conservative spin systems Landim, Panizo and Yau [L-P-Y] proved 
logarithmic Sobolev inequality for the following class of densities exp{—V{x'^)): 

n 

where (/> : M i-^ M is bounded, and has bounded first and second derivatives. It 
would be nice to find a common way to prove our perturbative theorem and the 
theorem of Landim, Panizo and Yau [L-P-Y] , but so far these two directions could 
not be united, and in fact [L-P-Y] contains the only non-perturbative result for the 
non-compact case. 

The definition of the 1/2(1 — 5)-contracivity condition is not very transparent, 
partly because it does depend on the ordering of the index set. If the indices are 
nods in a lattice in a Euclidean space, and if V is sufl&ciently symmetric, then the 
following consideration may help. The definition of V can often be extended in a 
natural way to infinite sequences y = {yi) indexed by the nods of the entire lattice. 
Let us consider the lexicographical ordering on the nods (i.e., on the index set). For 
every nod i, the symmetry with center i is a bijection between the nods precedeing 
resp. following z, and it often happens that if nods j and k are interchanged by this 
bijection then 

where y* denotes the sequence defined as follows: 

Vi = Vii Vk — Vj if j ^ interchanged by the bijection. 

With this ordering of the indices it is often possible to give effective bounds for 
II -Bill and ||-B2||- Moreover, for the important special case when the mixed partial 
derivatives are constants (3i^k-, the matrices Si, B2 have numerical entries. Also, we 
can use the following estimates: Denoting ai^k = sup^.^ |5i,fcy(a;"')|, and writing 

M = {c^i,k)i<k, M = {ai,k)i>k, 

we have 

||S,|| < ||A,||, i = l,2. 

One of the basic tools in the proof of the Theorem is a result by Otto and Villani 
[0-V], establishing a connection between the logarithmic Sobolev inequality and a 
transportation cost inequality for quadratic Wasserstein distance. (A transporta- 
tion cost inequality bounds some distance between measures by the relative entropy 
of the measures.) 
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Definition. The (quadratic) Wasserstein distance, or VF-distance, between two 
probability measures and and (in W^) is: 

VI7(p", q^) = inf[£;^|r" - 1/2^ 

TT 

where and X" are random variables with laws resp. q^, and infimum is 
taken over all distributions tt on R"' x with marginals and q^. 

A transportation cost inequality for quadratic Wasserstein distance was first 
proved by M. Talagrand [T], in the case of Gaussian distributions. It was generalized 
by Otto and Villani, who revealed a deep connection between logarithmic Sobolev 
and transportation cost inequalities. 

Theorem of Otto and Villani. [0-Vj, [B-G-LJ If the density function q^{x'^) 
satisfies a logarithmic Sobolev inequality with constant p then 

W'^ip'^.q'') < - ■ Dip'^Wq'^). 
P 

This form of the theorem was proved in [B-G-L]; the original formulation in 
[0-V] contained some minor additional condition. We shall use this theorem in 
dimension 1, except for the proof of the Corollary below. 

It is not known whether a transportation cost inequality for VT-distance does 
imply a logarithmic Sobolev inequality; the conjecture is no. 

The importance of the Otto- Villani theorem for us lies in the following: By the 
LSI&lC{p, 1/2(1 — 5)) condition we can, roughly speaking, estimate 
D{Qi{-\xi)\\Qi{-\zi)) in terms of the Euclidean distance of the sequences Xi and Zi. 
By the Otto- Villani theorem this translates to a bound for W'^{Qi{-\xi),Qi{-\zi)). 
Thereby we derive from the LS'/&;C(p, 1/2(1 — 5)) condition an analog of Do- 
brushin's uniqueness condition [D]. This will assure a contractivity property for a 
Markov chain that we construct to interpolate between p"^ and q^. (See Section 5, 
Lemma 3). Such an interpolation was also used in [M2]. 

Related to such estimates is the following corollary which may be interesting on 
its own right: 

Corollary to the Theorem. // the conditional density functions of q^ satisfy 
condition LSI&iC{p, 1/2(1 — 6)) then for any i e [1, n], and any sequences x\ 

W'{qn-\x^),q?m) <^■D{qn^x^mm)<(^^^ -Ix^-xf . (1.4) 

I.e., the mapping x^ i— > (-ja;*) is Lipschitz with respect to the Euclidean and qua- 
dratic Wasserstein distances, with Lipschitz constant (1 — S)/S. 
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The proof of the Corollary (Section 5) will make it clear that for (1.4) we only 
need the Theorem forn — 1. This allows to use the Corollary in a proof by induction. 

In an earlier paper [M2, Theorem 2] we considered distributions satisfying 
conditions similar to LSI&:C{p, 1/2(1 — 6)), and proved a transportation cost in- 
equality. In view of the Otto-Villani theorem, this is weaker than a logarithmic 
Sobolev inequality. The contractivity condition for partial derivatives in the present 
paper is somewhat stronger than the one in Theorem 2 of [M2]. Note that in [M2] 
we also considered the more general case dealing with conditional density functions 
Qi{'\^i), where / runs over a collection of (small) subsets of [l,n], and Qi{-\xi) is 
the joint conditional density function of the random variables (Xi : i & I), given 
the values {Xj : j ^ I). We did not try to achieve this generality in the present 
paper. 

The proof of the Theorem is quite different from the approach taken by Bodineau 
and Helffer, who derived logarithmic Sobolev inequality from the exponential decay 
of correlations. Our proof is based on a discrete time interpolation connecting the 
distributions p"^ and q"'. This interpolation is more complicated than the Gibbs 
sampler used in [M2], and may seem somewhat artificial. Moreover, it does not 
seem applicable for stochastic simulation. However, we could not find any simpler 
interpolation doing the job. 

Even apart from the restriction of generality, the present paper does not make 
Theorem 2 of [M2] superfiuous. The proof of Theorem 2 in [M2], at least for 
the case considered in the present paper, is much simpler, and has more intrinsicly 
understandable meaning. Moreover, Theorem 1 in [M2] is not implied by the present 
result. 



2. An auxiliary theorem for estimating relative entropy. 

We shall use the following notation: If Y and X are random variables with values 
in the same space, and with distributions p and q, respectively, then we denote by 
iI>(y||X) the relative entropy D{p\\q). Moreover, if we are given the conditional 
distributions dist(y|y), dist(X|?7) and the joint distribution tt = dist{U,V), then 
we denote by -D(F|[/||X| V) the following average relative entropy: 

D(Y\V\\X\U) = E^D{Y\V = v\\X\U = u), 

where denotes expectation with respect to n. Also, we use the notation 

D{Y\V\\P{-\U)) = E^D{Y\V = v\\P{-\U = u)), 

where P{-\U) denotes a conditional distribution. 

In this section we prove the following 
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Auxiliary Theorem. Let X"^ he a random sequence with conditional density func- 
tions Qi{-\xi), and let {Y'^{t) : t = 1,2, . . .) he a discrete time random process in 
M". Then for any s > 1 

D{Y''{1)\\X'') 

s n 

t=i i=i 

+ L>(F"(s + l)||X'^). (2.1) 
In particular, if 

lim D{Y'^{s)\\X'^) ~ along some suhsequence (2-2) 

then 

L>(r"(i)||x'^) 

oo n 

t=i i=i (2.3) 



Remark. 

A frequently used tool in bounding relative entropy is the decomposition 

n 

D{Y^X^) = Y,D{Yi\Y'-^\\qi{-\Y'-^)), 

i=l 

where qi{-\y^~^) = dist(Xi|X*~^ = y^~^). This decomposition has the drawback 
that D{Yi\Y^~^\\qi{-\Y'^~^) cannot be easily bounded by a relative entropy with 
respect to Qi{-\Yi) or anything similar. The Auxiliary Theorem bounds D[Y'^\\X^) 
by an infinite sum of relative entropies, all with respect to some conditional density 
function of form Qi{-\-). 

Proof. 

First we prove (2.1) for s = 1: 

D(r"(i)||x") 

n 

<5^i^(y,(i)|y^-i(i),F-(2)||g,(-|y^-i(i),y,-(2))+i?(y-(2)||x-). 

i=i (2.4) 
To this end, we prove the following by recursion on i: 

n 

D(r'^(i)||x'^)< J]i?(r,(i)|r^-i(i),i^'^(2)||Q,(-|r^-i(i),i^-(2))) 
j=i 

+ D{{Y'-\l),Y,^_,i2))\\X^), z = l,2,...,n. (2.5) 
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For z = 1 this is just (2.4). 

To prove (2.5) for i = n, we use the expansion of relative entropy: 

L>(r'^(i)||x") = D(y'^-Hi)||x"-i) + i?(F„(i)|F"-\i)||Q^(-|r--Hi))) 

<i^(r„(l)|F-^(l)||g„(-|F-i(l))) + i;((F"-i(l),F,(2))||X"). (2.6) 
Applying (2.6) for D{Y''-\l),Yn{2)\\X''), we get 

D(y"-i(i),y„(2)||x^) 

<D(y^_i(i)|y^-2(i),y,(2)||Q._i(-|F--2(i),r,(2))) 

+ D(F-2(l),r,_i(2),F,(2)||X-). 
Substituting this into (2.6), we get (2.5) for z = n — 1: 

L»(y"(l)||X") < D{Ynil)\Y^-\l)\\Qni-\Y^-\m 

+ i;(y„_i(i)|y-2(i),yn(2)||Q„_i(-|y-2(i),y,(2))) 
+ D(y"-^(i),y„_i(2),y„(2)||x-). 

This procedure can be continued to get (2.5) for every z, thus proving (2.4). 
Applying (2.4) to D{Y''{2)\\X''), we get 
L»(y"(2)||X") 

n 

<5]L»(y,(2)|y^-i(2),y-(3)||Q,(-|y^-i(2),y-(3))) + i;(y"(3)||x"). 

Substituting this into (2.4), we get (2.1) for s = 2. It is clear that the same 
argument applied to D(y"(t)||X"'), will prove (2.1) for any s. □ 

The proof of the Auxiliary Theorem is fairly simple, but to use it we need a 
process (y"(t), t = 1,2,...) that admits good estimates for the terms in the sum 
(2.3), and also satisfies (2.2). The construction and analysis of such a process is 
the subject of the rest of the paper. 
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3. Preliminaries for the construction. 

We shall use the following 

Notation. Let (T, U, V) be a triple of random variables. We write 

to express the Markov relation 

T and V are conditionally independent given U. 

The following definition and lemma are crucial in the forthcoming construction. 

Definition. Consider a pair of random sequences (V^, S'^) in R"' x R"'. We say- 
that (y", S'^) (or its distribution) is loosely connected if for every i e [1, n] 

Yi^ {Y'-\S^) ^ S\ (3.1) 

Lemma 1. 

(i) For any joint distribution dist{Y^,S^), there exists a distiY"^ ^ S"^) such that, 
for every i G [l,n], we have dist{Y'^,S^) = distiY^^S^), and {Y^,S^) is loosely 
connected. 

(a) The joint distribution {Y'^,S'^) is loosely connected if and only if {S'^,Y'^), with 
the ordering of the indices reversed, is such, i.e., iff 

S, ^ {Y^-\ Sr) ^ Y,^.^, l<i<n. (3.2) 



Proof. 

(i): Define the joint distribution dist(yi, yS"^) by 

dist {S"" ) = dist (5'^ ) , dist ( Yi , ) = dist (Fi , ) and Yi ^ Si . 

Let us assume that we have already defined dist(y*~-^, 5'") in such a way that 
dist(,S^) = dist(,S"), and ioik<i-l 

dist(F^^^)=dist(r^5^) and %^(Y^-\S]:)^SK (3.3) 

We can extend this definition to a joint distribution dist(y*,S'") so that (3.3) is 
preserved for k < i. Indeed, we define dist(y*, S^^) = dist(y*, 5'f ), which is possible 
since dist(y*-\ S^) = dist(y*-S 5f ), and then make 

Yi {Y'-\S^) -^S\ 

So the proof can be completed by recursion. 
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(ii): It is clear that (3.1) implies 

Si^{Y'-\S^)^Yi. (3.4) 
Moreover, applying (3.1) for z + 1, 

(^i, Si-^i) (y, S'j^i) i^i+i, 

whence 

Si^{Y\S^)^Yi+^. (3.5) 
Putting together (3.4) and (3.5), we see that 

S,^{Y'-\S'l)^(Y,,Y,+r). 

Iterating this reasoning we arrive at (3.2). □ 

Definition. The random pair (K", S^) in Lemma 1 will be called the loosely con- 
nected copy of (F",^"). 

A basic step in the construction of the process (F"^(t)), to be used in the Auxiliary 
Theorem, is singled out in the next definition. 

Definition. Let us be given the system of conditional distributions Q as in (1.1), 
and a loosely connected joint distribution dist(y^, S"^). We extend dist(y^, S"^) to 

dist(y",S'",r/") (3.6) 

as follows. First we define, for every i, the conditional distribution dist(r7i|y"^, S"^). 
To this end, put 

disi{'ni\Y'-\s^) = QMy'-\s^). 

Then define 

^ist{Y,,'qi\Y'-\Sf) (3.7) 
as that joining of the conditional distributions 

dist(Fi|F*-\5f) and Qi{-\Y'-\ S^), (3.8) 

that achieves VF-distance for any given value of the conditions. Thereby we have 
defined disi{r]i\Y\ S^). Then put 

dist(r7i|y^, 5^) = disi{r^i\Y\ ,5^), 

and make the random variables rji, i = 1, . . . ,n, conditionally independent of each 
other, given (Y'^^S'^). 

The random triple (3.6) will be called the Q-extension of dist(y^, S'^). 
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4. The interpolation process and the Main Lemma. 

Let us be given a loosely connected joint distribution 



dist(r-(o),r-(i)), 



(4.1) 



and the system of conditional density functions Q (c.f. (l-l))- Starting from the 
distribution (4.1), we are going to construct a discrete time process 



for which then the Auxiliary Theorem will be applied to estimate DiY'^{l)\\X'^). 
The evolution of the process will be governed, in a tricky way, by the conditional 
density functions Qi{-\xi). The process (4.2) can be considered as an interpolation 
between F'^(l) and X". 

Actually we are going to build a process 



We want this process to satisfy the following requirements: 

(i) The sequence (F"^(t), t = 0, 1, 2, . . . ) is a Markov chain; 

(ii) the ?7"(t)'s are conditionally independent of each other, given the entire 



process (F"(t), t = 0, 1, . . . ); 

(iii) for t > 1, 

r{t + 1) ^ (y"(t - 1), Y^{t)) ^ (y"(s), s^t-l,s^t); 

(iv) Fort > 1, (F'^(t-l),F^(t),?7^(t+l)) is the Q-extension of (r^(t-l), r'^(t)); 

(v) for t > 2 and all i 

dist(y,(t + iWl'it + 1), y"(s), s < t) = dist(Fi(t + l)\Y'-^{t), Y^^{t + 1)) 
= dist(r7,(t + lM-\t),r^f{t + 1)). (4.4) 

For t = 1 we require only that (y"(l), y"(2)) be loosely connected, and that 
dist(y"(2)) = dist(r7"(2)). 

Requirement (v) implies that dist(y^(t), Y'^{t + 1)) is the loosely connected copy 
of dist(?7"^(t), ri'^{t + l)) (see (ii) of Lemma 1), in particular dist(y"(t)) = dist(?7'^(t)) 
for t > 2. 



{Y^{t):t = 0,1,2,...) 



(4.2) 



{{Y^{t-l),r{t + l)):t = l,2,...). 



(4.3) 



We construct the process (4.3) by induction on t. Let us assume that we have 
already defined 

dist((y"(0),y"(s),r7"(s + l)) ■.l<s<t) (4.5) 
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with properties (i-v). (For t = 1 this obviously can be done.) We extend (4.5) to 
dist((y"(0), F'^(s), r]''{s + l)) -.1 <s <t + l). (4.6) 
To this end, first we define 

dist(y"(s),s < t+ 1) (4.7) 

so as to satisfy (4.4). (We use the fact that dist(y"(t)) = dist(?7"'(t)), imphed by 
the induction hypothesis, and define recursively, for z = n, n — 1, . . . , 1, 

distiYiit + i)|y,"(t + 1), y-(s), s<t).) 

Having defined (4.7), we make ¥"^{1 + 1) conditionally independent of {r]'^{s),s < 
t+ 1), given (y"(s),s < t). Then we complete the definition of (4.6), extending 
(4.7) by 77"' (t + 2), so as to satisfy (ii), (iii) and (iv). The inductive construction is 
complete. 

Note that the joint distribution (4.1) and the conditional density functions (1.1) 
do not completely determine the process (Y^{t)), since condition (v) allows some 
freedom in the definition of dist(y'^(l), y"'(2)). 

In the rest of the paper (^(t)) = {V{t) : t = 0, 1, 2, . . . ) will always denote 
the process (4.2), i.e., a process built up as in this section, starting from a joint dis- 
tribution (4.1), governed by the conditional density functions (1.1), and satisfying 
properties (i-v). 

We are going to use the Auxiliary Theorem for the process (4.2). First we 
formulate an intermediate result. We shall use the following notation for the terms 
on the right-hand-side of (2.3): For t > we write 

n 

Dt = Y,D{m)\y'-\t),Y:^it+i)\m-\Y^-\t),Y:^it+i))). (4.8) 

Thus the sum on the right-hand-side of (2.3) is equal to -Dt- (Note that Dq 

is not included in the sum.) 

In Sections 5-7. we prove the following 

Main Lemma. If the system of conditional density functions Q satisfies condi- 
tion LSISzC{p, 1/2(1 — d)) then there exists a random sequence such that the 
conditional density functions of are the functions Qi, and 



The proof of the Main Lemma articulates as follows: After some preliminaries, 
in Section 6 we show that the sum in (2.3) is overbounded by the right-hand-side 
of (4.9). Then in Section 7 we prove the convergence (2.2). Finally, In Section 8 
we estimate Do and Di in terms of lip^Wq^)-, where = dist(y'^(l)). 
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5. Preliminaries for the proof of the Main Lemma. 

We shall need the following lemma: 

Lemma 2. 

For any joint distributions dist{U,V), dist{U,Y) and dist{V,X), 

D(Y\\X) < D(Y\U\\X\V). 



Proof. Write d7r{u, v) = dist(t/, V). Define dist{Y\U ^ u,V ^ v) ^ dist{Y\U = u), 
and dist(X|t/ = u,V = v) = dist(X|F = v). Then 

dist(y) = J dist{Y\U = u,V = v)d7r{u,v), 



and 



dist(X) = J dist{X\U = u,V = v)d7r{u,v). 



It is known that i3>(r (r, s probability distributions) is a convex function of the 
pair (r, s). Therefore, the above representations imply 



D(Y\\X)< D{Y\U = u,V = v\\X\U = u,V = v)d7r{u,v) = D(Y\U\\X\V). □ 



The analysis of the process (4.2) is based on Lemmas 3 and 4 below. 
Lemma 3. 

Assume that the conditional density functions Qi{-\xi) satisfy condition 
LSmC{p,l/2{l-5)). Then: 

(i) For any quintuple of sequences («"(!), t(."(2), t"^(l), t"(2), C"^) ; 



i=l 



Q,(0^-Hl),<(2)) 



Q,(0|t-i(l),tF(2)) 



|w'^(l) - r(l)p + |w"(2) - r(2)p 



(5.1) 



If u^{l) = t'^{l) then the right-hand-side can be divided by 2. 

(a) For any quadruple of random sequences (t/"^(l), [/"^(2), T"^(l), T"'(2)) . 



Y,D{Q,{^U^-\l),Ur{2))\m.\T^-\l),Tl^{2))) 



< 1/4-p- (1 -5)2 -E 



C/^(l) - r"(l)|2 + |C/"(2) - T^(2)| 



(5.2) 
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Consequently, 

n 

Ej2w^{Qii-\U'-\l),Un'2)),Qii-\T'-\l),Tn2))) 



< l/2-{l-5f -E 



- T"(l)|2 + |t/^(2) - T^(2)| 



n/oM2 



(5.3) 



For U'^{1) = T'^(l) the right-hand-side can be divided by 2. 
Proof. 

(i): We have 



i=l 



5, log 



g,(a|w^-^(i),<(2)) 



(5,(C.|t^-Hi),tr(2)) 



n 



a,logQi(0k'-'(l),<(2))-aaogQi(0|t*-'(l),<(2)) 



i-l/ 



c»,iogQi(Ci|t^-'(i),<(2))-aaogg.(Ci|t^-'(i),tr(2)) 



i-l. 



(5.4) 



To estimate the first term, fix the vectors and tt" (2) , and consider the mapping 

g = giy'^) : ^ R", 

defined by 

g^iy'^) = log Q^{C^\y'-\ <(2)), i = 1, 2, . . . , n. 

The first sum on the right-hand-side of (5.4) is just the squared Euclidean norm of 
the increment of g between points t"(l), tt"(l). By the contr activity condition for 
partial derivatives, ((C) of Section 1), the norm of the Jacobian of g is bounded by 
1/2 ■ p ■ {1 — 5), so the first sum in (5.4) can be bounded by 

l/4-p2.(l_5)2.|^n(i)_^n(i)|2_ 

A similar bound holds for the second term. Inequality (5.1) is proved. The second 
statement of (i) is obvious. 

(ii): Since the conditional density functions Qi{-\xi) satisfy the logarithmic 
Sobolev inequality with constant p, we have 



Y,D{Qi{^u'-\i),unmQi{-\T'-\^),Tnm 



i=l 



n 



1 



^E^-^ / QiiCi\u'-\i),ur{2)) 



dilog 



Q,{Q\W-\l),Uri2)) 
Q,{QT^-^{1),TI^{2)) 



dCi 



2p 



E 



n 



E 

i=l 



dilog 



Q,(0|^--Hl),^r(2)) 

g,(G|Ti-ni),7;"(2)) 



2 n 



YlQ^{C^\U'-\l),Ul^{2))dC 



(5.5) 
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Since the product in (5.5) is a density function, (5.1) implies (5.2). To get (5.3) 
from (5.2), apply the Otto-Villani theorem to each Qj(-|t*-^(l), t"(2)). □ 



Proof of the Corollary to the Theorem. 

Assume that the Theorem holds for n — 1. Fix i, and x*. By the Otto-Villani 
theorem 

W'{qn-\x'),qn-\x')) < ^ ■ D{qn-\x')\\qn-\x')) 

n 



2^' 



pd 



k=i+l 



Now we use the argument of the proof of Lemma 3. Fix and consider the 
mapping 



defined as 

gk{x^)=dklogqnax'). i<k<n. 

The sum within the last integral is the increment of g between points and x*. 
The Jacobian of g{x'^) is the matrix 

{d^,k log q^{x\ , (1 < j < ^, ^ < A; < n), 

i.e., a minor of — i?i(x"^, x"^, C")- Therefore, by the Contractivity Condition, the 
norm of the Jacobian of g{x'^) is at most 1/2 • p • (1 — 5). Thus 



E \dk\ogq^{QW)-dk\ogq^{Q\x^) \^<l/A. {1 - 5f ■ p 

fc=i+l 

and (1.4) follows. □ 



2 'x^-xi^, 



We shall need the Markov operator defined by the system of conditional density 
functions Q as follows. 

Definition. 

Let G be a Markov operator on the probability measures tt"^ (on M") whose tran- 
sition function is 

n 

G{v^\u^)^\[Qi{v,y-\u^). 

Note that if a density function has conditional density functions Qi then it is 
invariant with respect to G. 
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Lemma 4. 

(i) If dist{U'^) and distiV"^) have a joining dist{U'^,V'^) such that 

distiVilV'-^Ul") = Qi{-\V'-\ C/f ), l<i<n, (5.6) 
then distiV) = dist{U'')G. 

(ii) If the system of conditional density functions Q satisfies condition LSISzC{p, 1/2(1 — 
S)), then for any two distributions tv^ and on WP- , 

;y^((7^G',7r"G') < (1 - 5)2^/2 (c7",7r^), 

i.e., G is a strict contraction with respect to W -distance. Consequently, forO there 
exists a unique invariant distribution q^. 

Proof. 

(i) We show that dist(t/"') and relations (5.6) uniquely determine dist(y^). Indeed, 

it is clear that dist(C/f , V^i) is well-defined by dist(C/"') and (5.6). Then wc sec that 
dist(t/^, Fi, V2) is well-defined, too, and, by recursion, dist(C/", V*) is well-defined 
for all i. For i = n we get that dist(y^) is well-defined. 

(n) Write 

a" = dist(t/"(2)), (7"G = dist(t/^(l)) 

7r" = dist(T^(2)), 7r^G = dist(T^(l)). 

Let us be given a joint distribution for (?7"(2), T'^{2)). We inductively build up a 
joint distribution for the quadruple ([/"(I), T"(l), t/^(2), T"(2)) as follows. Let us 
assume that, for some i, 1 < i < n, we have already defined 

dist(C/^-^(l),T^-^(l),C/"(2),T"(2)). 

Then put 

dist(C/,(l)|C/^-i(l),T^-i(l),C/-(2),T-(2)) =g,(-|t/^-i(l),C/-(2)), 
dist(T,(l)|t/^-i(l),r-i(l),C/-(2),T-(2))=Q,(.|r-i(l),Tf(2)), 

(5.7) 

and define 

dist(t/,(l),T,(l)|[/^-i(l),T^-i(l),t/^(2),T^(2)) 

as that joining of the conditional distributions (5.7) that achieves VF-distance for 
every value of the conditions. By (5.3) of Lemma 3, 

n . 

E\U^{1) - T^{1)\^ = Y,^W^[Qi{W~\^),Ur{2)),Qi{^T'-\l),Tp{2^ 

i=i ^ 

C/"(l) -T"(l)|2 + |t/"(2) -T^(2)|2 . 
Rearranging terms we get 

< (1 -5)2 ■ |^7"(2) -T"(2)f. □ 



< 1/2- (1 -5)2 -E 
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6. Estimating the sum in (2.3). 

The sum in the right-hand-side of (2.3) is -Dt, where 

n 

D, = Y,D{Y,{t)\Y^-\t),Y^{t^-l)\\QMY'-\t),Y:^{t + l))). 

i=l 

In this section we estimate Df in terms of Dq and Di. 

Lemma 5. Assume that the conditional density functions Qi{-\xi) satisfy condition 
LSIkC{p, 1/2(1 - S)). Then for the the process (4.2) 

Proof. It is enough to prove the recursive bound 

Dt < 1/2 ■ (1 - 5)2 . {Dt_2 + Dt-i), [t > 2), (6.2) 

since (6.2) will imply (6.1) by a simple calculation. 

Let us estimate the z'th term in the definition of Dt- We claim that for t > 2 

D{Y,it)\Y^-\t), Y^it + l)\m-\Y^-Ht), Y^t + 1))) 
< D{Q,{.\Y'-\t - 2), Y^{t - l))\\Q,W-\t\ + I)))- (6.3) 

To prove (6.3), recall that, by Lemma 1 and property (v) of the process (y"(t), ?7"^(t)), 
the joint distribution dist(y"'(t), Y'^{t+1)) is the loosely connected copy of dist(?7"'(t), rj" 
1)), thus 

D{Y,{t)\Y^-\t),Y^{t + l)\m-\Y^-\t),Y^{t + 1))) 
= i?(r7,(t)|r7^-i(t),r7r(t + l)||Q^(-|r7^-^(t),r7r(t + l))). (6.4) 
By Lemma 2 the right-hand-side can be bounded as follows: 

<i?(r7.(t)|r7^-i(0,r7r(t + l),l^'-'(^-2),i;"(t-l)||Q.(-|r7^-^(t),r7r(i + l))). (6.5) 
To handle (6.5), we need the following relation of conditional independence: 

r7,(t) ^ iY^-\t - 2), Y^^it - 1)) ^ (r?^-i(t), <(t + 1)). (6.6) 
Assume for a moment that (6.6) is proved. Since by definition 

dist{vmiY'-\t - 2), y-(t - 1)) = Q.{-\Y^-\t - 2), i;"(t - 1)), (6.7) 
(6.6) implies that the right-hand-side of (6.5) is just 

D{Q,i.\Y'-\t-2),Y-it-l))\\Q,W-\t),vnt + m^ 



20 



LOGARITHMIC SOBOLEV INEQUALITY 



and (6.3) follows. 

So let us prove (6.6). By construction r]i{t) is conditionally independent of all the 
process (y"(s), ri'^{s)) given {Y^{t — 2),Y^{t — 1)). (C.f. property (iv) in Section 4 
and the definition of Q-extension in Section 3.) Thus (6.6) will be proved as soon 
as we show that 

Y,{t - 2) ^ {Y'-\t - 2),Y-{t - 1)) ^ {rj'-\t), r/^t + l))- (6.8) 

We know that (F"(t — 2),Y'^{t — 1)) is loosely connected, i.e., 

y.(t _ 2) ^ {Y'-\t - 2), Y,^{t - 1)) ^ Y\t - 1). 

So for the proof of (6.8) it is enough to show that 

Y,{t - 2) ^ {Y^-\t - 2), Y-{t - 1)) ^ {v'-\t),rjnt + 1)). (6.9) 

This follows from the relations 

rf~^{t) is conditionally independent of 
everything, given {Y'-'^{t - 2), y"(t - 1)), 

rf'{t is conditionally independent of 

everything, given (^ (t - 1), ^ (t)). 



and 



y"(t - 2) ^ y"(t - 1) ^ y"(t). 

Thus (6.8), and consequenyly (6.6) and (6.3) are proved. 
By (6.3) and Lemma 3 we have for t > 2 

n 

Dt<Y.D{QMY'-\t-2),Yl^{t-l))\\QMri'-\t),r^:{t + l))) 



<1/A-(1-Sy -p-E 



|y"(t - 2) - r7"(t)|^ + |y"(t - 1) - r7"(t + 1)|^ 



(6.10) 



To bound the right-hand-side of (6.10) by Dt_2 and Dt_i, recall that 
E{\Y,{t - 2) - r7.(t)|2 I Y^-\t - 2), y,"(t - 1)} 

= Ew' (dxst{Y,{t - 2)\Y^-\t - 2), y-(t - 1)), g,(-|y'-^(t - 2), Y:^{t))) . 
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Therefore, by the Otto-Villani theorem, 

E\Y^{t - 2) - rj^{t)\' = f^^HlUt - 2) - Vi{t)\^ \ y'~\t - 2), - 
2 " 

< - ■ Y.D{Ut - 2)\Y^-\t - 2), Y^t - l)\m-\Y^-\t - 2),Yr{t - 1))) 

^ i=l 

= --A-2. (6.11) 

p 

Inequahties (6.10-6.11) imply (6.2) and hence (6.1). □ 

Remark. By the definition of VF-distance and by (6.11) 

W^2(y"(t),y"(t + 2)) < E|y"(t-2) -r/^(t)p < -■Dt-2, 

p 

thus 

W^2(y^(t - 2), y^(t)) ^ exponentially fast. (6.12) 

Wc shall need this to prove that (y"'(2t — 1), ^^(2^)) converges, along some subse- 
quence, in VF-distance. 
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7. Convergence properties and proof of the Main Lemma 

Before we proceed to the proof of the Main Lemma , we estabhsh Wasserstein- 
convergence for the process (y"(t)). 

Lemma 6. If the conditional density functions Qi{-\xi) satisfy condition 
LSI&lC{p, 1/2(1 — 5)) then there exists a joint distribution dist{X'^ , Z"^) such that, 
along some subsequence of the integers t, 

lim (r''(t), ¥""{1 + 1)) = (X", Z^) in W-distance. (7.1) 

Moreover, 

dist{X\ Z2) = q"" for all i. (7.2) 

Proof. We claim that there exists a joint distribution dist(X"^, Z'^.T^) such that 

{Y^{t - 1), rj^it + 1), Y^{t), rj^it + 2),Y-{t+ 1), r,^{t + 3)) 

^ (X^,X",Z^,Z^,T",T") in W^-distance, (7.3) 

along some subsequence of the values of t. Indeed, the sequences {Y'^{2t)) and 
(y"'(2t +1)) being Cauchy with respect to VF-distance, the process in (7.1) is 
uniformly square integrable, hence tight, hence compact with respect to weak con- 
vergence, and hence also for VT-convergence. Since 

lim E|y'"(t-2) -?7^(t)|2 = 0, 

t— »oo 

(7.3) follows. 

To prove (7.2), we deduce from (7.3) the following relations: for all i, 

dist{Xi\X'-\Z^) = Qi{.\X'-\Z^), dist{Zi\Z'-\Tn = Qi{-\Z'-\Tn, 

(7.4) 

and 

dist(X'^) = dist(T'^). (7.5) 

The latter is clear, since limt^oo W^{Y''{t), F^(t + 2)) = 0. The first line of (7.4) 
follows from the relations 

{Y'-\t - l),r)i{t + l),Y^''{t)) {X\Z^) in distribution, 

and 

dist{rji{t + i)\Y'-\t-i),Y:'{t)) = Q,(.|y^-i(t- i),y-(t)). 

The second line can be proved similarly. 
By Lemma 4, (7.4) implies that 

dist(X^) = dist(Z^)G', and dist(Z") = dist(T^)G'. 
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By (7.5) this means that dist(X"^) = dist(Z'"^) is invariant with respect to G, thus by- 
Lemma 4 it must be (f- . This, together with (7.4), imhes (7.2). □ 

Proof of the Main Lemma. 

By the Auxihary Theorem, it is enough to prove that 

hm D{Y^{t)\\X'^) = along some subsequence. (7.6) 

t— »oo 

Since dist(y (t)) = dist(77"(t)), (7.6) is equivalent to 

lim D{r]'^{t)\\X'^) = along some subsequence. 

t—KX) 

We shall move along a subsequence such that 

lim (y^(t - 2), y^(t - 1)) = (X", Z"") in W^-distance. 

t— »oo 

By Lemma 2 and relations (6.6-6.7) we have 

n 

n 

< J]i?(r7,(t)|r7^-\t),y^-i(t-2),F-(t-l))||Q,(-|r7^-i(0) 

i=l 

n 

= J2D{Q,{.\Y^-\t - 2), Y-it - l))U{-\rj^-\t)), (7.7) 
1=1 

where qi{-\y'~^) = dist{Xi\X^-^ = y^'^). 

We are going to replace {t)) by Qi(-|X*~^), by the cost of a small error 

term. This requires a tedious approximation argument which we carry out in the 
Appendix, proving the following lemma. 

Lemma 7. Let dist{X^) = be fixed, and assume that its conditional density 
functions Qk{-\xk) satisfy condition LSI&6C{p, 1/2(1 — d)). Fix an s > 0. There 
exists a 7 > such that for any i and any joint distribution dist{Yi,rj'^~^ , X'^) we 
have the implication 

|E|Fi < 72 & E|r/*-^ < 7^1 
=> D{Q,{-mh{-\r^^-')) < D{Q,{-m\q,{-\X^-')) + e. (7.8) 

We give the proof in the Appendix, and apply the lemma now. 
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Fix an £ > 0, and find the corresponding 7 > 0. Select t so large that 
^2((yn(t _ 2), y"(t - 1)), (X^, Z^)) < 7^/4, 



and 



E|?7"(t) -r"(t-2)| <7V4, 



where (X"^, Z"^) is defined in Lemma 6. Define the joint distribution dist(y"'(t — 
2), - 1), X'^, Z") to achieve 

E|(F'^(t - 2), - 1)) - (X", Z")H < 774; 

this also ensures E|?7*~^(t) - < 7^ for all z. 

Wc apply Lemma 7 with — 2),Y[^{t — 1)) in place of Yi, with ri'^~^{t) in 

place of ?7*~^ and with ZJ^) in place of X'^. Using also Lemma 2 and (7.2) of 
Lemma 6: 

D{QMY'-\t-2),Ynt-mhiW-')) 

< D{Q,{-\Y^-\t - 2), - 1))U{-\X^-') + e 

< D{QMY'-\t - 2), - l))\\QMX'-\ ZD + e. (7.9) 

Now we can apply Lemma 3 with (y"(t-2), y"(t-l)) in place of ([/""(l), ?7"(2)), 
and with (X",Z") in place of (T"(l), T"(2)). We get 



D{QMY'-\t - 2), Yt{t - l))\\QMX'-\ Z-)) 



i=l 



< 1/4(1- 5)2. p 



\Y'^{t - 2) - X^l^ + |y"(t - 1) - Z 



7n\2 



(7.10) 



Inequalities (7.7), (7.9) and (7.10) imply 

D(F"(t)||X") < L'(?7"(t)||X") < £ + 7^ 



The Main Lemma is proved. 



□ 
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8. Bounds by the Fisher information: end of the proof 

To derive the Theorem from the Main Lemma, we speciahze the process 
taking for dist(y"(0), y^(l)) the loosely connected copy of (y"(0), y^(0)), where 
dist(y^(0)) =p". Thus, in particular, 

dist(y*(0), Yl'{l)) = p'^ l<i<n. 

The joint distribution dist(y"'(0), y"'(l)) does determine 
dist(y"(0), y"(l), r7"(2)), but we are free to select dist(y"(l), y^(2)) subject to the 
conditions that it be loosely connected, and dist(y"'(2)) = dist(?7"'(2)). We define 
dist(y"(l),y"(2)) as follows. Let us extend dist(y (0), ^^(1), ?7''(2)) to a joint 
distribution dist(r7"(l),y"(0),y"(l),r/"(2)) in such a way that dist(r;"(l), y"(0)), 
too, be the loosely connected copy of (y'^(O), y'^(O)) as well, and the following 
Markov relations hold: 

77^(1) ^ y'^(o) ^ y"(i), ?7"(i) ^ (^(0), y'^(i)) ^ 77^(2). (8.1) 

Then let dist(y"(l), y"(2)) be the loosely connected copy of (r/"(l), r7"(2)). 

For this case we can estimate Dq and Di in terms of For Dq this is 

easy. Since dist(y*(0), = for all i, we have 

n 

D, = Y,D{Ymy'-\0),Y;^{l)\\Q,{.\Y^-\0),Y;^{l))^ 

n 

= j2D{YmYm\Qimm)- 

1=1 

Applying the logarithmic Sobolev inequality for Qi{-\Yi{0)), we get that 

Do < ^Hp^qn- (8.2) 
2p 

The estimate for Di is more complicated. By the definition of dist(y'^(l), y'^(2)): 

n 

D, = J2DiY^{l)\Y^-\l),Y:^i2)\m.\Y^-\l),Y:^i2))) 
1=1 

n 

= 5]i?(r7,(l)|r7^-i(l),r7r(2)||g.(-|r7^-^(l),r7n2))). (8.3) 

i=l 



To overbound the terms in the last line of (8.3) we need the Markov relation 

Vi{l)^{v'-\l),YriO))^Vn'2)- (8.4) 



26 



LOGARITHMIC SOBOLEV INEQUALITY 



By the loose connection of {r]^{l),Y^{0)), for (8.4) it is enough to prove that 



or 



r/-(l)^y-(0)^r7"(2). 
This follows from the definition of ri'^{l) (see (8.1)). 

Using (8.4), Lemma 3 and the equality dist(77*(l), Y-^{0)) = p'^ we get 

D{ij,{i)\ij^-\i),ijn2)\m-\v'-\i),v?m) 
<D{l^,{lM-\l),Y:^mQ^(■\v'-\l),v?m) 

= ^(p^(-|^^-'(l),>^.nO))||Q^(-|r7^-'(l),r7r(2))). 
By the logarithmic Sobolev inequality, this implies 

< ^ .E/(p,(.|r/-i(l),l^-(0))||Q,(-K-^(l),r/n2))). 



Substituting this into (8.3) we get 

^ n „ 



diXog 



p,{y,\l^^-\l),Y^m 



Q,(y,|r/-Hl),^r)(2) 



Using the relation dist(?7^-^(l), y,"(0)) = pi and the bound (a + 6)^ < 2a^ + 21?, 
the last line can be continued as follows: 



1 



Di<2 Ve 



diXog 



P 



Qiivilvi) 



diXog 



dyi 



Q,(y,|r7^-i(l),y-(0)) 



Q,(y,|r/-Hl),^I^(2)) 



(8.5) 



The first term on the right-hand-side of (8.5) is ^I{p'^\\q'^). The second term 
can be written as 

2 n 

l[p,{u,\rj'-\l),Y,-{0))du\ 
1=1 

By Lemma 3 for any u'^ e we have the inequality 

2 

< 1/4 • p • (1 - 5f ■ |r"(0) - ?7"(2)|2, 



•E 



1 



dilog 







Qi{ui\v'~ 


-Hi),^r(2)) 



E 

^- 1 



^ilog 



Qi{uiW 




Qi{ui\v'~ 


-Hi),^r(2)) 
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SO the second term in the right-hand-side of (8.5) is bounded by 

1/4 • p • (1 - 5)2 . E|y"(0) - r/"(2)|^ (8.6) 

By the definition of dist(r^(0), y"(l), 77^(2)) and by the Otto-Villani theorem 
(8.6) can be continued as follows: 

<l/A.p.{l-5f-Y^ E|{E|y,(0) - r7,(2)ny^-i(0), 1^-(1)}| 



< 



1/2 • (1 - df ■ J2D{Y^mY^-\o), yr(i)||Q.(-|y^-\o), y-(i))) 



< 1/4- (1 -5)2 . 1 ./(p'^llg'^). (8.7) 
P 

From (8.5-8.7) we get the estimate 

Di < -[1 + 1/4(1-5)2] ./(p^llg'^). 

p 

Since Dq < ^ ■ /(p"||q'"), the Main Lemma yields 

This completes the proof of the Theorem. □ 
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Appendix 



Proof of Lemma 7. 

For the joint distribution dist(Fj, ?7*~^, X'^) consider the conditonal relative entropy 

D{Q,{-\Yi)M-\v'~')) = / i?(Q.(■|yOlk^(•|^^"'))^^^(^^,^'"'), (A.l) 

where integration is with respect to dist(Fi, ry*"^). Our first aim is to restrict 
integration in (A.l) to a smaUer set, by the cost of a small error term. This smaller 
set will be compact, and will have other useful properties. 

First we prove, for fixed yi and z^~^, the following estimate which allows to 
control the truncation by L2-techniques: 



D{Qi{-\y,)\\qi{-\z^-')) 



(A.2) 



To prove (A.2), let us introduce an auxiliary random variable S^, putting 

dist{S^\rj'-\Y^ = qnW-'), (A.3) 

where q^{-\x'-^) = dist{XJ\X'-'^ = x''^). Let {Yi,r]i-i) X"". Thereby 

we have defined dist(X'^, Fj, ?7^-^ S^). 



By Lemma 2 we have 

D[QMmQ^{¥~^))<Dm■\yi)\mv~\s:)), 



(A.4) 



where dist(,Sf ) = q^{-\z'-^) 



By the logarithmic Sobolev inequality for Qi{-\z^ ^,3^^), the right- hand-side of 
(A.4) can be estimated as follows: 

D{Q,{-\y,)\m-\^'-\s:^)) < ^ / i{Q,{-mmv-\sf))-qns7\z'-')ds2 

QMyi)\diViy'-\v,y?) - diViz'-\r],s^)\^dr] 



1 

2p 



qns:\z'-')ds2. 

(A.5) 



By Lemma 3 we have the following bound for the inner integral 

|2 



/ QMyi)\d,Viy'-\v,y:)-d,Viz'-\v,s^) 
Jr 



drj 



.i— 1 1|2 I L,n „n|2 



+ m - 



(A.6) 
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Estimates (A.5-A.6) imply (A. 2) 



To exploit (A. 2), we need bounds for the integrals of \Yi\ , \rj^ \ and \S^\ over 
sets of small probability. 

Assume 

E|?7^-i < £2 and E\Yi - Xi\'^ < e'^ . 
Then for any set E, measurable with respect to {X"^, Yi, r]^~^, S^), we have 

■Xe}<2- E{\X,f ■ xe} + 2£^ 
E{|r7^-T ■Xe}<2- E{|X^-1|2 • xe} + 2e\ (A.7) 
To handle [(Sf we recall the Corollary to the Theorem (1.2), saying that 

W'^{S'l,X]^) <C^ ■W'^{r]'-^,X^-^) <C2-£2. (A.8) 

where C is a constant depending on 5. (Recall that for this we only need the 
Theorem for n — 1.) By (A.8), 

H\S7?-Xe] <2-E{|Xf|2.Xi?} + 2C2-£2. (A.9) 

Let i^T C M be a compact interval such that the integral of outside of 

is less than . Then, by (A.2), (A.7) and (A.9) 

D{Q,{-\Yi)\\qMri'-^)) 

< [ D(g,(-|y,)lki(-|^'"')M7r(y,,^^-i) + 0(e2), 

^K"-ixK*-i (A. 10) 

where 0(£^) is bounded by an absolute constant times • 

Further select 7 > so small that for any m and any measure /j, on x 
whose marginal for the first n coordinates is q'^, we have 

EgW X W^, ii{E) < 7 =^ E{|X'^|2 . xe] < e^. 

Assume 

EIV"^ -X^-^ < 7^, 



and define 

B 

We have then 



Pr{{r]^-^,X'-^) ^ 5} < 7. (A.ll) 
Let M = (ii:"-i X K^-i X K^-i) n B. Then (A.IO) can be continued as follows: 

D{QMYi)\\q^{■W-^)) < [ i?(gi(-|yi)||gi(-|^^-^))(i//(yi,^'-\x^-^) + 0(£2), 

Jm 

(A.12) 

where = dist(Fj,?7* ^). (The integrand does not depend on ^, but the 

domain of integration does.) Thus in estimating D{Qi{-\Yi\\qi{-\ri'^~^)), integration 
can be restricted to those values of {yi, z'^~^ ,x'^~^), for which all coordinates lie in 
K, and \z'^~-^ — a;*~-^| < 7. 

To overbound the integral (A.12) we are going to use the following easy-to- prove 
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Facts. 

Let f{x,y) : X R'^ I— > R 6e a continuous function such that f{x, •) is integrable 
for every x. Define 



9l{x) = J f{x,y)dy, L 



c R^ 



Then: 

(i) For every compact L g'MJ' giix) is continuous; 

(a) for every compact M C R* there exists a compact L gM.^ such that 

\g^k {x) - gL{x) <e, x e M. 

It easily follows from these Facts that q'*(a;*) = J^n-i q^{x'^)dx'^ is continuous in 
x*, and therefore so is qi{xi\x'^~^). 

Consider the integrand in (A. 12): 

D{Q^{■\yi)\\<l^{V)) - [ Q^m)log-^/^J^^dr]. (A.13) 

Jr QiiW' '■) 

By Fact (ii), for (y^, z^~^) e K'^~^ x K'^~^ we can replace the domain of integration 
R by [—A, A], with some finite A, depending on K: 



D{Q,{-\y,)\\q,{-\z^-')) - r Qiivmog-^f^^dji 

J— A QiKVl^ ) 



< £ on K^-^ X K'-^, 



(A.14) 

for A large enough. By Fact (i), the integral in (A.14) is uniformly continuous on 

Make on 7 > the additional constraint that the integral in (A.14) changes by 
less than s if we replace z^~^ e K'^"^ by x'^~^ e K^~^, provided \z'^~'^ — a;*~^| < 7: 



QMvi) log -T^^TT^^ - / ^^(^1^*) - 

qi{v\z' 1) J-A Qi 



QiHVi) 



i{r]\x'' 1) 

for (|/'^-^2^-^a;'-^) e K"-^ x K^'^ x K^'^ , \z'-^ - x'-^\ < 7. 



< e (A.15) 



Intcgating (A.15) over M, and taking into account the definition of the set B 
and the bound (A.14): 

/ i^(gi(-|yi)l|g»(-l^^-^))dMyi,^^"',^'"') 

< [ i?((5.(-|^.)|k.(-|a;^-i))dM^i,^'-\x^-^) + 0(£). 
Jm 

Substituting this into (A. 12) we get 

^(Q^(■|>^^)lki(■|^^~')) <^(Q^(■|>^^)lki(■|^^-'))+0(£). □ 
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